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Abstract 

GENIE [H is a new neutrino event generator for the experimental neutrino physics community. 
The goal of the project is to develop a 'canonical' neutrino interaction physics Monte Carlo 
whose validity extends to all nuclear targets and neutrino flavors from MeV to PeV energy 
scales. Currently, emphasis is on the few-GeV energy range, the challenging boundary between 
the non-perturbative and perturbative regimes, which is relevant for the current and near future 
long-bascline precision neutrino experiments using accelerator-made beams. The design of the 
package addresses many challenges unique to neutrino simulations and supports the full life-cycle 
of simulation and generator-related analysis tasks. 

GENIE is a large-scale software system, consisting of ^120,000 lines of C-I--I- code, featuring a 
modern object-oriented design and extensively validated physics content. The first official physics 
release of GENIE was made available in August 2007, and at the time of the writing of this article, 
the latest available version was v2.4.4. 
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1. Introduction 



Over the last few years, throughout the field of high energy physics (HEP), we have witnessed 
an enormous effort committed to migrating many popular procedural Monte Carlo Generators 
into their C++ equivalents designed using object-oriented methodologies. Well-known examples 
are the GEANT Q, HERWIG and PYTHIA [| Monte Carlo Generators. This reflects a 
radical change in our approach to scientific computing. Along with the eternal requirement that 
the modeled physics be correct and extensively validated with external data, the evolving na- 
ture of computing in HEP has introduced new requirements. These requirements relate to the 
way large HEP software systems are developed and maintained, by wide geographically-spread 



collaborations over a typical time-span of ~25 years during which they will undergo many (ini- 
tially unforeseen) extensions and modifications to accommodate new needs. This puts a stress on 
software qualities such as re-usability, maintainability, robustness, modularity and extensibility. 
Software engineering provides many well proven techniques to address these requirements and 
thereby improves the quality and lifetime of HEP software. In neutrino physics, the requirements 
for a new physics generator are more challenging for three reasons: the lack of a 'canonical' pro- 
cedural generator, theoretical and phenomenological challenges in modeling few-GcV neutrino 
interactions, and the rapidly evolving experimental and theoretical landscape. 

Neutrinos come from many sources and a variety of experiments have been mounted to measure 
their properties. These experiments have complicated detectors composed of many elements and 
the neutrinos have many flavors and a wide energy spectrum (from 1 MeV to 1 PeV). 
Our long-term goal is for GENIE to become a 'canonical' neutrino event generator with wide 
applicability. The origins of the code come from the Soudan experiment and recent application 
has been primarily to MINOS Thus, emphasis has been given to the few-GeV energy range, 
the challenging boundary between the non-perturbative and perturbative regimes. These are 
relevant for current and near future long-baseline precision neutrino oscillation experiments using 
accelerator-made beams, one of the focuses of high energy physics. GENIE development over the 
next five years will be driven by the upcomi ng g eneration of accelerator experiments including 
T2K 0, NoVA [i], Minerva MicroBooNE [iq and ArgoNEUT [uj. These developments are 



well underway and the code is being used successfully in each of these experiments. The present 
version provides comprehensive neutrino interaction modelling in the energy from ^^100 MeV to 
a few hundred GeV. Results can be obtained and will be qualitatively correct for any nuclear 
target. 

GENIE0 is a ROOT-based Neutrino MC Generator. It was designed using object-oriented 
methodologies and developed entirely in CH — h over a period of more than three years, from 2004 
to 2007. Its first official physics release (v2.0.0) was made available on August 2007 and, at 
the time of writing this article, the latest available version was v2.4.4. It also describes v2.6.0 
which will be released shortly. GENIE has already been adopted by the majority of neutrino 
experiments, including those using the JPARC and NuMI neutrino beamlines, and will be an 
important physics tool for the exploitation of the world accelerator neutrino program. 

The project is supported by a group of physicists from all major neutrino experiments operating 
in this energy range, establishing GENIE as a major HEP event generator collaboration. Many 
members of the GENIE collaboration have extensive experience in developing and maintaining 
the legacy Monte Carlo Generators that GENIE strives to replace, which guarantees knowledge 
exchange and continuation. The default set of physics models in GENIE have adiabatically 
evolved from those in the NEUGEN [3l package, which was used as the event generator by 
numerous experiments over the past decade. 

This article will discuss the paradigm shift brought about by GENIE in neutrino physics simu- 
lations. In Sec. [2] we describe the unique challenges facing neutrino simulations in more detail. 
Section [3] gives a brief overview of the physics models available in GENIE. Section 2] gives a brief 
discussion of upgrades in progress. Section [5] describes the object-oriented design of GENIE. 
Section [6] describes the GENIE applications and utilities available for simulation and analysis 
tasks. Section [7] describes the structure of the GENIE collaboration and Section [8] describes code 
availability, distribution, supported platforms, external dependencies, releases, and license. 



^ GENIE stands for Generates _Events for JVcutrino interaction _Experiments 



2. Neutrino Interaction Simulation: Challenges and Significance 

Neutrinos have played an important role in particle physics since their discovery half a century 
ago. They have been used to elucidate the structure of the electroweak symmetry groups, to 
illuminate the quark nature of hadrons, and to confirm our models of astrophysical phenomena. 
With the discovery of neutrino oscillations using atmospheric, solar, accelerator, and reactor neu- 
trinos, these elusive particles now take center stage as the objects of study themselves. Precision 
measurements of the lepton mixing matrix, the search for lepton charge-parity (CP) violation, 
and the determination of the neutrino masses and hierarchy will be major efforts in HEP for 
several decades. The cost of this next generation of experiments will be significant, typically tens 
to hundreds of millions of dollars. A comprehensive, thoroughly tested neutrino event generator 
package plays an important role in the design and execution of these experiments, since this tool 
is used to evaluate the feasibility of proposed projects and estimate their physics impact, make 
decisions about detector design and optimization, analyze the collected data samples, and eval- 
uate systematic errors. With the advent of high-intensity neutrino beams from proton colliders, 
we have entered a new era of high-statistics, precision neutrino experiments which will require a 
new level of accuracy in our knowledge, and simulation, of neutrino interaction physics |14l |. 

While object-oriented physics generators in other fields of high energy physics were evolved 
from well established legacy systems, in neutrino physics no such 'canonical' MC exists. Until 
quite recently, most neutrino experiments developed their own neutrino event generators. This 
was due partly to the wide variety of energies, nuclear targets, detectors, and physics topics 
being simulated. Without doubt these generators, the most commonly used of which have been 
GENEVE [H], NEUT [H, NeuGEN [3, NUANCE [l^ and NUX [3, played an important 
role in the design and exploration of the previous and current generation of accelerator neutrino 
experiments. Tuning on the basis of unpublished data from each group's own experiment has 
not been unusual making it virtually impossible to perform a global, independent evaluation for 
the state-of-the-art in neutrino interaction physics simulations. Moreover, limited manpower and 
the fragility of the overextended software architectures meant that many of these legacy physics 
generators were not keeping up with the latest theoretical ideas and experimental measurements. 
A more recent development in the field has been the direct involvement of theory groups in the 
development of neutrino event generators, such as the NuWRO [31 and GiBUU [lOl packages, 
and the inclusion of neutrino scattering in the long-established FLUKA hadron scattering package 

Simulating neutrino interactions in the energy range of interest to current and near-future ex- 
periments poses significant challenges. This broad energy range bridges the perturbative and 
non-perturbative pictures of the nucleon and a variety of scattering mechanisms are important. 
In many areas, including elementary cross sections, hadronization models, and nuclear physics, 
one is required to piece together models with different ranges of validity in order to generate 
events over all of the available phase space. This inevitably introduces challenges in merging and 
tuning models, making sure that double counting and discontinuities are avoided. In addition 
there are kinematic regimes which are outside the stated range of validity of all available models, 
in which case we are left with the challenge of developing our own models or deciding which model 
best extrapolates into this region. An additional fundamental problem in this energy range is a 
lack of data. Most simulations have been tuned to bubble chamber data taken in the 70's and 
80's. Because of the limited size of the data samples (important exclusive channels might only 
contain a handful of events), and the limited coverage in the space of {v/v^E^, A), substantial 
uncertainties exist in numerous aspects of the simulations. 

The use cases for GENIE are also informed by the experiences of the developers and users of 
the previous generation of procedural codes. Dealing with these substantial model uncertainties 
has been an important analysis challenge for many recent experiments. The impact of these 
uncertainties on physics analyses have been evaluated in detailed systematics studies and in 
some cases the models have been fit directly to experimental data to reduce systematics. These 



'downstream' simulation-related studies can often be among the most challenging and time- 
consuming in an analysis. 



To see the difficulties facing the current generation of neutrino experiments, one can look no 
further than the K2K and MiniBooNE experiments. Both of these experiments have measured a 
substantially different distribution for their quasielastic-like events when compared with their 
simulations, which involve a standard Fermi Gas model nuclear model 2^, 2^. The disagreement 



between nominal Monte Carlo and data is quite large - in the lowest bin of MiniBooNE the 
deficit in the data is around 30% [2^ . It seems likely that the discrepancies seen by both exper- 
iments have a common origin. However the two experiments have been able to obtain internal 
consistency with very different model changes - the K2K experiment does this by eliminating the 



charged current (CC) coherent contribution in the Monte Carlo [2J] and the MiniBooNE experi- 
ment does this by modifying certain parameters in their Fermi Gas model psj . Another example 
of the rapidly evolving nature of this field is the recently reported excess of low energy electron- 
like events by the MiniBooNE collaboration [2^. These discrepancies ha ve g enerated si gnifi cant 
new theoretical work on these topics over the past several years [2^ . 27. 28. 



The situation is bound to become even more interesting, and complicated, in the coming decade, 
as new high-statistics experiments begin taking data in this energy range. Designing a software 
system that can be responsive to this rapidly evolving experimental and theoretical landscape is 
a major challenge. 

In this paper we will describe the ways in which the GENIE neutrino event generator addresses 
these challenges. These solutions rely heavily on the power of modern software engineering, 
particularly the extensibility, modularity, and flexibility of object oriented design, as well as the 
combined expertise and experience of the collaboration with previous procedural codes. 



3. Neutrino Interaction Physics models in GENIE 

The set of physics models used in GENIE incorporates the dominant scattering mechanisms from 
several MeV to several hundred GeV and are appropriate for any neutrino flavor and target type. 
Over this energy range, many different physical processes are important. These physics models 
can be broadly categorized into nuclear physics models, cross section models, and hadronization 
models. 

The neutrino-nucleus interaction involves a large variety of processes, all of which must be mod- 
eled to get an accurate description of the experimental signature of any detector and its many 
components. Since most theoretical models describe a small subset of these processes, GENIE 
must include many models. The broad energy range and the many nuclei to be covered force 
choosing models that have very broad applicability. 

The particle in the nucleus with which the neutrino interacts depends strongly on the energy. 
At high energies {Ei, >10 GeV) neutrinos interact with a single quark inside a nucleon (neutron 
or proton); the code must model this interaction and the distribution of the residual quarks. At 
lower energies, the struck particles are neutrons and protons. The neutrino tends to strike a single 
nucleon (impulse approximation) which is affected by the nuclear medium in which it resides. In 
the high energy regime, the large body of neutrino-nucleon data is sufficient for development of 
a full model. At lower energies, neutrino-nucleon data has been used for the basic process and 
nuclear models developed for other probes (especially electrons) are adopted. 

The 2 recent major developments have been in the transition region and the final state interactions 
(FSI) model. In physics model development for GENIE we have been forced to pay particular 
attention to this 'transition region', as for fcw-GcV experiments it dominates the event rate. 
In particular the boundaries between regions of validity of different models need to be treated 
with care in order to avoid theoretical inconsistencies, discontinuities in generated distributions, 
and double-counting. Treatment of FSI involves many aspects of nuclear physics and strong 



interactions. Many events where particles are produced by the neutrinos have their topologies 
and kinematics altered. There are many effects to include and some dispute about the right 
techniques to use. Therefore, FSI treatment is one of the largest differences among models of the 
neutrino-nucleus interaction. 

In this brief section we will describe the models available in GENIE and the ways in which we 
combine models to cover regions of phase space where clear theoretical or empirical guidance is 
lacking. 

3.1. Nuclear Physics Model 

The importance of the nuclear model depends strongly on the kinematics of the reaction. Nuclear 
physics plays a large role in every aspect of neutrino scattering simulations at few-GeV energies 
and introduces coupling between several aspects of the simulation. The relativistic Fermi gas 
(REG) nuclear model is used for all processes. GENIE uses the version of Bodek and Ritchie 
which has been modified to incorporate short range nucleon-nucleon correlations js^]. This is 
simple, yet applicable across a broad range of target atoms and neutrino energies. The best 
tests of the REG model come from electron scattering experiments [s^. At high energies, the 
nuclear model requires broad features due to shadowing and similar effects. At the lower end 
of the GENIE energy range, the impulse approximation works very well and the REG is often 
successful. The nuclear medium gives the struck nucleon a momentum and average binding 
energy which have been determined in electron scattering experiments. Mass densities are taken 
from review articles (sij . Eor A <20, the modified Gaussian density parameterization is used. 
For heavier nuclei, the 2-parameter Woods-Saxon density function is used. Thus, the model can 
be used for all nuclei. Presently, fit parameters for selected nuclei are used with interpolations 
for other nuclei. All isotopes of a particular nucleus are assumed to have the same density. 

It is well known that scattering kinematics for nuclcons in a nuclear environment are different 
from those obtained in scattering from free nuclcons. For quasi-elastic and elastic scattering, 
Pauli blocking is applied as described in Sec. 13.21 For nuclear targets a nuclear modification 
factor is included to account for observed differences between nuclear and free nucleon structure 
functions which include shadowing, anti-shadowing, and the EMC effect [37| . 

Nuclear reinteractions of produced hadrons is simulated using a cascade Monte Carlo which will 
be described in more detail in a following section. The struck nucleus is undoubtedly left in a 
highly excited state and will typically de-excite by emitting nuclear fission fragments, nucleons, 
and photons. At present de-excitation photon emission is simulated only for oxygen (ssl-lsoj due to 
the significance of these 3-10 MeV photons in energy reconstruction at water Cherenkov detectors. 
Future versions of the generator will handle de-excitation photon emission from additional nuclear 
targets. 

3.2. Cross section model 

The cross section model provides the calculation of the differential and total cross sections. During 
event generation the total cross section is used together with the flux to determine the energies of 
interacting neutrinos. The cross sections for specific processes are then used to determine which 
interaction type occurs, and the differential distributions for that interaction model are used to 
determine the event kinematics. While the differential distributions must be calculated event- 
by-event, the total cross sections can be pre-calculated and stored for use by many jobs sharing 
the same physics models. Over this energy range neutrinos can scatter off a variety of different 
'targets' including the nucleus (via coherent scattering), individual nucleons, quarks within the 
nucleons, and atomic electrons. 

Quasi-Elastic Scattering: Quasi-clastic scattering (e.g. v^ + n ^ /i~ -\-p) is modeled using an 
implementation of the Llewellyn-Smith model (4ol |. In this model the hadronic weak current is 
expressed in terms of the most general Lorentz-invariant form factors. Two are set to zero as they 



violate G-parity. Two vector form factors can be related via CVC to electromagnetic form factors 
which are measured over a broad range of kinematics in electron elastic scattering experiments. 
Several different parametrizations of these electromagnetic form factors including Sachs (4]| . 
BBA2003 [13 and BBBA2005 ^ models are available with BBBA2005 being the default. Two 
form factors - the pseudo-scalar and axial vector, remain. The pseudo-scalar form factor is 
assumed to have the form suggested by the partially conserved axial current (PCAC) hypothesis 
[40| . which leaves the axial form factor Fyi(Q^) as the sole remaining unknown quantity. F^(0) 
is well known from measurements of neutron beta decay and the dependence of this form 
factor can only be determined in neutrino experiments and has been the focus of a large amount 
of experimental work over several decades. In GENIE a dipole form is assumed, with the axial 
vector mass m^ remaining as the sole free parameter with a default value of 0.99 GeV/c^. 

For nuclear targets, the struck a suppression factor is included from an analytic calculation of the 
rejection factor in the Fermi Gas model, based on the simple requirement that the momentum of 
the outgoing nuclcon exceed the fermi momentum kp for the nucleus in question. Typical values 
of kp are 0.221 GeV/c for nuclcons in ^^C, 0.251 GeV/c for protons in ^^Fe, and 0.256 GcV/c 
for neutrons in ^^Fe. 

Elastic Neutral Current Scattering: Elastic neutral current processes are computed accord- 
ing to the model described by Ahrens et al. [3], where the axial form factor is given by: 

The adjustable parameter rj includes possible isoscalar contributions to the axial current, and the 
GENIE default value is 77 = 0.12. For nuclear targets the same reduction factor described above 
is used. 

Baryon Resonance Production: The production of baryon resonances in neutral and charged 
current channels is included with the Rein-Sehgal model [45[. This model employs the Feynman- 
Kislinger-Ravndal (46j model of baryon resonances, which gives wavefunctions for the resonances 
as excited states of a 3-quark system in a relativistic harmonic oscillator potential with spin-flavor 
symmetry. In the Rein-Sehgal paper the hclicity amplitudes for the FKR model are computed and 
used to construct the cross sections for neutrino-production of the baryon resonances. From the 
18 resonances of the original paper we include the 16 that are listed as unambiguous at the latest 
PDG baryon tables and all resonance parameters have been updated. In our implementation of 
the Rein-Sehgal model interference between neighboring resonances has been ignored. Lepton 
mass terms are not included in the calculation of the differential cross section, but the effect of 
the lepton mass on the phase space boundaries is taken into account. For tau neutrino charged 
current interactions an overall correction factor to the total cross section is applied to account 
for neglected elements (pseudoscalar form factors and lepton mass terms) in the original model. 
The default value for the resonance axial vector mass m^ is 1.12 GcV/c^, as determined in the 
global fits carried out in Reference [i^. 

Coherent Neutrino-Nucleus Scattering: Coherent scattering results in the production of 
forward going pions in both charged current {v^ + A /i^ + 7r+ + A) and neutral current 
(f^ + A ^ Vfj^ + tt'^ + A) channels. Coherent neutrino-nucleus interactions are modeled according 
to the Rein-Sehgal model (isj . Since the coherence condition requires a small momentum transfer 
to the target nucleus, it is a low-Q^ process which is related via PCAC to the pion field. The 
Rein-Schgal model begins from the PCAC form at Q^=0, assumes a dipole dependence for non- 
zero Q^, with niA = 1.00 GeV/c^, and then calculates the relevant pion-nucleus cross section 
from measured data on total and inelastic pion scattering from protons and deuterium (49j . The 
GENIE implementation is using the modified PCAC formula described in a recent revision of the 
Rein-Sehgal model |50i that includes lepton mass terms. 

Non- Resonance Inelastic Scattering: Deep (and not-so-deep) inelastic scattering (DIS) is 
calculated in an effective leading order model using the modifications suggested by Bodek and 



Yang [37[ to describe scattering at low . In this model higher twist and target mass corrections 
are accounted for through the use of a new scaling variable and modifications to the low parton 
distributions. The cross sections are computed at a fully partonic level (the vq-^lq' cross sections 
are computed for all relevant sea and valence quarks). The longitudinal structure function is taken 
into account using the Whitlow R (i? = Fl/2xFi) parameterization [5l[. The default parameter 



values are those given in [37| , which are determined based on the GRV98 LO parton distributions 
[5^ . An overall scale factor of 1.032 is applied to the predictions of the Bodek-Yang model to 
achieve agreement with the measured value of the neutrino cross section at high energy (100 GeV). 
This factor is necessary since the Bodek-Yang model treats axial and vector form modifications 
identically and would therefore not be expected to reproduce neutrino data perfectly. This overall 
DIS scale factor needs to be recalculated when elements of the cross section model are changed. 

The same model can be extended to low energies; it is the model used for the nonrcsonant 
processes that compete with resonances in the few-GeV region. 

Quasi-Elastic Charm Production: QEL charm production is modeled according to the Ko- 
valenko local duality inspired model [H^ tuned by the GENIE authors to recent NOMAD data 



54| 



Deep-Inelastic Charm Production: DIS charm production is modeled according to the 
Aivazis, Olness and Tung model [s^. Charm-production fractions for neutrino interactions are 



taken from [56|, and utilize both Peterson [57| and CoUins-Spiller [58[ fragmentation functions, 
with Peterson fragmentation functions being the default. The charm mass is adjustable and is 
set to 1.43 GcV/c2 by default. 

Inclusive Inverse Muon Decay: Inclusive Inverse Muon Decay cross section is computed 
using an implementation of the Bardin and Dokuchaeva model [H^ taking into account all 1-loop 
radiative corrections. 

Neutrino-Electron Elastic Scattering: The cross sections for all i^e— scattering channels 
other than Inverse Muon Decay is computed according to [g^I . Inverse Tau decay is neglected. 

3.2.1. Modeling the transition region 

As discussed, for example, by Kuzmin, Lyubushkin and Naumov jHlj one typically considers the 
total vN CC scattering cross section as 

0-*°* = O-Q^^ © 0-1'' © cr^'' © ... © cr^^ © ... © fJ^^-^ 

In the absence of a model for exclusive inelastic multi-particle neutrinoproduction, the above is 
usually being approximated as 

= uQ^^ © © a^^^ 

assuming that all exclusive low multiplicity inelastic reactions proceed primarily through res- 
onance neutrinoproduction. For the sake of simplicity, small contributions to the total cross 
section in the few GeV energy range, such as coherent and elastic ve~ scattering, were omitted 
from the expression above. In this picture, one should be careful in avoiding double counting the 
low multiplicity inelastic reaction cross sections. 

In GENIE release the total cross sections is constructed along the same lines, adopting the 
procedure developed in NeuGEN [l3| to avoid double counting. The total inelastic differential 
cross section is computed as 



dQ'^dW dQ^dW dQ^dW 



The term cfia^^^ /dQ'^dW represents the contribution from all low multiplicity inelastic channels 
proceeding via resonance production. This term is computed as 
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where the index k runs over all baryon resonances taken into account, Wcut is a configurable 
parameter and {d'^a^^ / dQ'^dW)k is the Rein-Seghal model prediction for the fc*'' -"t"^""""" 
section. 



resonance cross 



The DIS term of the inelastic differential cross section is expressed in terms of the differential 
cross section predicted by the Bodek-Yang model appropriately modulated in the "resonance- 
dominance" region W < Wat so that the RES /DIS mixture in this region agrees with inclusive 



cross section data [63, |63|, |6J, [65 
13, [Zi, [zi, m [U and 2-pion 
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82 



6^, M, M, S Izii and exclusive I-pion jzl, M, [3, M, M, 
761] cross section data: 



f}2 DIS M DIS,BY 



In the above expression, m refers to the multiplicity of the hadronic system and, therefore, the 
factor /m relates the total calculated DIS cross section to the DIS contribution to this particular 
multiplicity channel. These factors are computed as /,„ = Rm-P^^f''^ where R„i is a tunable 
parameter and P^"''^ is the probability, taken from the hadronization model, that the DIS final 
state hadronic system multiplicity would be equal to m. The approach described above couples 
the GENIE cross section and hadronic multiparticle production model [s^. 



3.3. Neutrino-induced hadronic multiparticle production modeling 

Neutrino-induced hadronic shower modeling is an important aspect of the intermediate energy 
neutrino experiment simulations, as non-resonant inelastic scattering becomes the dominant in- 
teraction channel for neutrino energies as low as 1.5 GeV. 

Experiments are sensitive to the details of hadronic system modeling in many different ways. 
Experiments making calorimctric measurements of neutrino energy in charged current reactions 
are typically calibrated using single particle test beams, which introduces a model dependence 
in determining the conversion between detector activity and the energy of neutrino-produced 
hadronic systems. Physics analysis can also depend on the prediction of the hadron shower 
characteristics, such as shower shapes, energy profile and particle content, primarily for event 
identification. A characteristic example is a —^ v^, appearance analysis, where the evaluation 
of backgrounds coming from neutral current (NC) events, would be quite sensitive on the details 
of the NC shower simulation and specifically the tt" shower content. It is therefore imperative 
that the state-of-the-art in shower modeling is included in our neutrino interaction simulations. 

GENIE uses the AGKY hadronization model which was developed for the MINOS experiment 
[s^. This model integrates an empirical low- invariant mass model with PYTIIIA-6 at higher 
invariant masses. The transition between these two models takes place over an adjustable window 
with the default values of 2.3 GeV/c^ to 3.0 GeV/c^, so as to ensure continuity of all simulated 
observables as a function of the invariant mass. For the hadronization of low-mass states the 
model proceeds in two phases, first determining the particle content of the hadronic shower, and 
secondly determining the 4-momenta of the produced particles in the hadronic center of mass. 



The AGKY's low mass hadronization model generates hadronic systems that typically consist of 
exactly one baryon (p or n) and any number of tt"*" , 7r~ , 7r° , , K~ , , K*^ mesons kinematically 
possible and allowed by charge conservation. 

For a fixed hadronic invariant mass and initial state (neutrino and hit nuclcon), the algorithm 
for generating the hadron shower particles generally proceeds as follows: 



Compute the average charged hadron multiplicity < rich > for a given invariant mass (W) 
using empirical expressions of the (< rich >= o,ch + bch * InW^) form. The coefficients ach, 
hch, which depend on the initial state, have been determined by bubble chamber experiments 

Compute the average total hadron multiplicity < ritot > as < ritot >~ 1-5 < rich >■ 

Using the calculated average hadron multiplicity, generate the actual hadron multiplicity 
taking into account that the multiplicity dispersion is described by the KNO scaling law, 
(< n > P{n) = f{n/ < n >)) [sl]. P(n) is the probability of having n hadrons in the final 
state given an expected average of < n >, and /(n/ < ?i >) is a universal scaling function. 
The KNO scaling is parametrized employing the Levy function with an input parameter 
Cch that depends on the initial state and is treated as a tuning parameter. 

Generate hadrons up to the generated hadron multiplicity taking into account the hadron 
shower charge conservation and the kinematical constraints. Protons and neutrons are 
produced in the ratio 2:1 for vp interactions, 1:1 for un and Dp, and 1:2 for Dn interactions. 
Charged mesons are then created in order to balance charge, and the remaining mesons 
are generated in neutral pairs. The probabilities for each are 31.33% (7r°,7r°), 62.67% 
(tt+jTt") and 6% of strange meson pairs. The probability of producing a strange baryon 
associated production is determined from a fit to A production data 8^, 87 , H, [8§| : 



via 



^hyperon — ^hyperon ^hyperon In (2) 

Fig. [1] shows the data/model comparisons of the average charged hadron multiplicity < rich > 
as a function of the squared hadronic invariant mass for vp and vn interactions. Fig. [2] shows 
the data/model comparisons of the negatively charged hadron multiplicity dispersion D_ as a 
function of the average charged hadron multiplicity < n_ > and the reduced dispersion -D-/ < 
?7_ > as a function of the squared hadronic invariant mass. 

The main dynamical feature observed in the study of hadronic systems is that the baryon tends 
to go backwards in the hadronic center of mass and that the produced hadrons have small trans- 
verse momentum relative to the direction of momentum transfer. These features are naturally 
described in the quark model where the baryon is formed from the diquark remnant, which goes 
backwards in the ccnter-of-mass, and transverse momentum is generated solely through intrinsic 
parton motion and gluon radiation. At low invariant masses energy-momentum constraints on the 
available phase space play a particularly important role. The most pronounced kinematical fea- 
ture in this region is that one of the produced particles (proton or neutron) is much heavier than 
the rest (pion and kaons) and exhibits a strong directional anticorrelation with the momentum 
transfer. 

Our strategy, therefore, is to correctly reproduce the final state nucleon momentum, and then 
perform a phase space decay on the remnant system employing, in addition, a pT-based rejection 
scheme designed to reproduce the expected meson transverse momentum distribution. The nu- 
cleon momentum is generated using input p^ and xp ~ {P*L/P*Lmax) PDFs which are parametrized 



based on experimental data [90|, |9lJ . Once the baryon momentum is selected the remaining par- 



ticles undergo a phase space decay. The phase space decay employs a rejection method suggested 



^The Levy function Levy{z; c) = 2e-'=c'=^+Vr(c2 -I- 1) 



in [94I, with a rejection factor e~^*P'^ for each meson. This causes the transverse momentum 
distribution of the generated mesons to fall exponentially with increasing p^, controlled by the 
adjustable parameter A which has a default value of 3.5 GeV~^. Here pt is the momentum com- 
ponent perpendicular to the current direction. One of the remaining challenges in this model, 
which will be addressed in the future, is to better describe forward and backward hemisphere 
multiplicity distributions. The forward/backward multiplicity distributions yield an unphysically 
rapid transition to the PYTHIA values, a feature not seen in other recent hadronization models 

Fig. [3] shows the data/model comparisons of the fragmentation function for positively and neg- 
atively charged hadrons. 2-body hadronic systems are a special case: The hadronic system 
4-momenta are generated by a simple unweighted phase space. 
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Figure 2: Data/model comparisons of the negatively 
charged hadron multiphcity dispersion D_ as a func- 
Figure 1: Data/model comparisons of the average tion of the average charged hadron multiplicity < 
charged hadron multiplicity < n^ji > shown as a func- n_ > (up) and the reduced dispersion -D-/ < n_ > 
tion of the squared hadronic invariant mass for and as a function of the squared hadronic invariant mass 
vn interactions. Data arc from Refs. l84l |94|| . (down). Data are from Ref. [s^l . 



3.3.1. Intranuclear rescattering 

The hadronization model describes particle production from free targets and is tuned primarily to 
bubble chamber data on hydrogen and deuterium targets [84, 90, 94, 96, 97, 95, 98, 99]. Hadrons 
produced in the nuclear environment may rescatter on their way out of the nucleus, and these 
reinteractions significantly modify the observable distributions in most detectors. 

It is also well established that hadrons produced in the nuclear environment do not immediately 
reinteract with their full cross section. The basic picture is that during the time it takes for 
quarks to materialize as hadrons, they propagate through the nucleus with a dramatically reduced 




Figure 3: Data/model comparisons of the fragmentation function for positively and negatively charged hadrons. 
Applied cuts: Squared hadronic invariant mass above 5 GeV^/c* and squared 4-momentum transfer above 
1 Gey2/c2. Data arc from Rcf. 



interaction probability. This was implemented in GENIE as a simple 'free step' at the start of 
the intranuclear cascade during which no interactions can occur. The 'free step' comes from a 
formation time of 0.523 fm/c according to the SKAT model |lOOl |. 

Intranuclear hadron transport in GENIE is handled by a subpackage called INTRANUKE . IN 



TRANUKE is an intranuclear cascade simulation and has gone through numerou s rev isions 101 1 
since the original version was developed for use by the Soudan 2 Collaboration [l02| . The sen- 
sitivity of a particular experiment to intranuclear rescattering depends strongly on the detector 
technology, the energy range of the neutrinos, and the physics measurement being made. IN- 
TRANUKE simulates rescattering of pions and nucleons in the nucleus. 

In principle one would like to have a fully realistic nuclear model which accurately describes the 
full range of processes to model particle production with energies as low as 1 MeV to ensure that 
the simulations are suitable for any conceivable experiment. Nuclear simulations of this type are 
themselves highly complex packages and the integration of these packages with GENIE is an area 
of active work. An alternative approach is to develop a simpler nuclear model, in the context of 
a particular experiment, and ensure that the relevant physics for that experiment are correctly 
described. This approach has the advantage of yielding a far simpler code, which is understood 
by the experimenters. This has particular advantages for the study of systematic errors and the 
development of ancillary code like reweighting packages. 

The current version was optimized for use by the MINOS experiment. For this experiment 
the task of developing an intranuclear rescattering model is simplified because the detector is 
composed largely of a single element, iron, and the detector is designed to make a calorimctric 
energy measurement rather than track individual particles. For the oscillation measurement of 
MINOS the primary goal is ensuring that the 'missing energy' lost in the nuclear environment 
is being reliably simulated. The model has applicability to almost all nuclei and a wide range of 
energies. 

To handle a wide energy range for neutrinos, GENIE has defined processes for hadrons up to 
1.8 GeV kinetic energy in terms of all the relevant cross sections. For higher hadron energy, 
the underlying cross sections are assumed to be constant at the 1.8 GeV value. This is a good 
approximation to the actual values. The code then has a description of all hadrons coming from 
neutrinos at all relevant energies. 

The simulation tracks particles through the nucleus in steps of 0.05 fm. For each particle only 
one reinteraction is allowed, and the simulation consists of the following steps: 



1. Mean Free Path: In order to determine if the particle interacts in a particular step the 
mean free path (A) is calculated based on the local density of nucleons (p(r))_[36| and a 
partial wave analysis of the large body of hadron-nucleon cross sections {auN) ^103.] : 



(^hN[Eh)p{r) 

. Here, afiN is the isospin averaged cross section for the propagating hadron (with energy 
Efi) interacting with a nucleon and p{r) is the matter density of nucleons at the position of 
the propagating hadron. We use charge densities which are well-measured and arc known 
to be very similar to the matter densities. The code presently tracks pions and nucleons. 
Isospin relations are used to estimate tt*^— nucleon reactions. All nuclei heavier than oxygen 
are modeled with a Woods-Saxon density distribution and lighter nuclei are modeled with 
a modified Gaussian distribution. 

One difficulty in this approach is that our treatment is using a semiclassical model to de- 
scribe a quantum mechanical process. This poses particular difficulty in describing elastic 
scattering which dominates the total cross section at low energy. This wave/particle distinc- 
tion depends on energy with lower energy hadron-nucleus scattering being more wave-like. 
To account for this we increase the size of the nuclear density distribution through which 
the particle is tracked by an amount 

f'A (4, 

where / is an adjustable dimcnsionlcss parameter set to 0.5 for pions and 1.1 for nucleons in 
the current default. We use the isospin-averaged total cross sections for pions and nucleons. 
Determining the Interaction Type: Hadron-nucleus interactions occur with different 
processes and each has an associated cross section - a^ias for elastic scattering (residual nu- 
cleus in the ground state), ainei for inelastic scattering (residual nucleus in excited state, typ- 
ical response is single nucleon emission), acex for single charge exchange (outgoing hadron 
changes charge, typical response is single nucleon emission) for all hadrons. For pions, 
emission of 2 or more nucleons with no pion in the final state is called absorption - aabs ] for 
nucleons, a final state with 3 or more nucleons is called multi-nucleon knockout - ako- For 
low energy nucleons, the knockout mechanism dominates. At higher energies (above about 
400 MeV for pions and 600 MeV for nucleons), the probability of pion production {aptprod) 
becomes important. The total cross section {atot) is the sum of all component cross sections 
and the reaction cross section (areac) is the sum of cross section for all inelastic reactions, 

^reac — ^cex ^" ^inel ^" ^abs ^" ^piprod — ^tot O'elas- (5) 

This equation is specific to pions; aabs is replaced by ako for nucleons. Once it has been 
determined that a hadron reinteracts in the nucleus, the type of the interaction is determined 
based on the measured cross sections for the above listed processes. Cross sections for kinetic 
energy less than 1 GeV are used and they are assumed constant above 1 GeV. Where dat a 
is sparse, cross section estimates are taken from calculations of the CEM03 group |l04| . 
Since only 1 reinteraction is allowed, the effect of additional reactions with the rest of the 
nucleus must be included here. The distribution of final states is optimized for iron, but 
valid for all targets. Figure [4] shows the default INTRANUKE compared with 7r+ data for 
atot and areac for iron and carbon. These are examples of many successful comparisons. 
Final State Products: Once the interaction type has been determined, the four-vectors of 
final state particles need to be generated. Where possible these distributions are parametrized 
from data or from the output of more sophisticated nuclear models |l05l |. Simple models 
are used for elastic scattering angular distributions. The quasielastic reaction mechanism is 
known to dominate the final state for inelastic or charge exchange processes. Fermi motion 
and binding energy are used to get a good description of the kinematics. Whenever 3 or 
more particles are emitted, distributions are according to phase space. For MINOS, the 
most important issue is missing energy generated by inelastic and absorption processes. 



Very low energy hadrons and nuclear recoils are not seen, so simplifications can be made. 
All states where more than 5 nucleons are emitted are treated as though 5 nucleons (3 
protons and 2 neutrons) were emitted. These restrictions are relaxed in the most recent 
code versions to better match what is seen in data. 
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7r+ total and reaction cross sections for iron (left) and carbon (right). Data are from Refs. llOa . llO 



3.^. Physics Model Tuning 

The full range of models involve more than one hundre d ad justable parameters, the complete 
set of which arc given in the Physics and User Manual [llOj . Only the most important in the 
construction of the physics models will be discussed here. Electroweak parameters and CKM 
matrix elements use the values of the Particle Data Group [i^ . 

As mentioned previously, the quasi-elastic, resonance production, and DIS models employ form 
factors, axial vector masses, and other parameters which have been determined by others in 
their global fits 43|, |47|. In order to check the overall consistency of our model, and to ver- 
ify that we have correctl y impleme nted the DIS model, predictions are co mpa red to electron 
scattering inclusive data 111 . Ill2| and neutrino structure function data |ll3| . The current 
default values for transition region parameters are WcMt=l-7 GcV/c^, R2{vp) — R2{T'n)—0.1, 
R2(vn) ~ R2{T'p)=0.3, and Rm~1.0 for all m > 2 reactions. These are determined from fits to 
inclusive and exclusive (one and two-pion) production neutrino int eract ion channels. For these 
comp arisons we rely heavily on online compilations of neutrino data (ll4l | and related fitting tools 
[ll5| that allow one to include some correlated systematic errors (such as arising from flux uncer- 
tainties). The GENIE default cross section for i/fj, charged current scattering from an isoscalar 
target, together with the estimated uncertainty on the total cross section, as evaluated in |ll6j 
are shown in Fig. [5l 

The tuning of the had ronization model is accomplished using data from the BEBC [o^] , FNAL 
117 1. and SKAT |ll8t bubble chamber experiments, and is described in more detail elsewhere 



119|. Multiplicity measurements include averaged charged and neutral particle (tt^) multiplici- 
ties, forward and backward hemisphere average multiplicities and correlations, topological cross 
sections of charged particles, and neutral - charged pion multiplicity correlations. Hadronic system 
measurements include fragmentation functions (z distributions), xp distributions , (transverse 
momentum squared) distributions, and xp — (Pt) correlations ("seagull" plots) [120[. Averaged 
charged particle mu ltipli city and dispersion parameters are taken from published values (s^ l , as 
well as our own fits IllQl l . Baryon 4-momentum distributions are determined from fits to experi- 



mental data l9l|. The settings for PYTHIA parameters are taken as the n on-d efault values 
tuned for the NUX [18| generator, a high energy generator used by the NOMAD 121 1 experiment. 



The intranuclear rescattering model has been tested and tuned based on comparisons to hadron- 
nucleus data. Hadron-nucleus cross sections are calculated by 'MC experiments' where a nucleus 




10 

E(GeV) 



Figure 5: GENIE default cross section for ufj, charged current scattering from an isoscalar targ et. The shaded 
band indicates the estimated uncertainty on the free nucleon cross section. Data are from |62| (CCFRR), 
(CDHSW), m (GGM-SPS), [H,!!! (BEBC), [H (ITEP), [H (CRS, SKAT), [H (ANL), [73 (BNL) and 
(GGM-PS) 



is being illuminated by a uniform hadron beam with transverse radius larger than the nucleus 
size. Figure |4] shows the comparison between INTRANUKE and data for Tr"*"— C and tt"*"— Fe 
total and reaction cross sections. Extrap olatio ns to higher energy are required in many cases as 
data for only areac is available. CEM03 |l05l | results with appropriate rescaling to match data 
at lower energies are often used. Although the model is tuned to hadron scattering on iron, the 
simplicity of the Fermi Gas model and the A^^/"^^ scaling of the cross sections allow the model to 
be applied to nearly all nuclei encountered in the simulation as well. 

Validation of the intranuclear rescatt ering model using neutrino data has also been performed 
This re 
terium 



This revisits the analysis of Reference [102 1 wh ich compares ANL neutrino scattering data on deu- 
to BEBC neutrino- neon data 122| . where each have been rescaled to an atmospheric 
neutrino flux. By comparing neon and deuterium final state topology fractions the rates for pion 
absorption and charge exchange in neutrino-neon interactions can be determined. INTRANUKE 
reproduces the measured final state topology fractions with an overall of 16.0 for 12 degrees 
of freedom. The rates of pion absorption and charge exchange produced by INTRANUKE in this 
comparison are 18.3 ± 0.5% and 2.9 ± 0.2% respectively, in good agreement with the measured 
values of 22 ± 5% and 10 ± 8%. 



In evaluating the uncertainty in the intranuclear rescattering model, several sources of uncertainty 



were taken into account for their effect on the MINOS determination of the hadronic energy scale 
[l23j . These include the experimental uncertainty on the external data that serves as the input 
to the model, as well as on some of the key theoretical assumptions in the model, in parti cula r 
the modeling of pion absorption reactions and the treatment of low energy pion scattering |l23l | . 



4. Recent Developments 

Recent focus on development for GENIE has been on the nuclear structure and final state interac- 
tion codes. The purpose is to make the code well-tuned to the needs of the upcomin g ge neration 
of accelerator experiments including T2K , NoVA Q , Minerva P , MicroBooNE and Ar- 



goNEUT We are also looking to push the validity range of GENIE down to t he M eV scale 



making it applicable for the study of neutrinos from reactors, supernova and SNS [12 

The Fermi Gas nuclear model h as b een shown to be wrong in detail through interpretation of 
electron scattering experiments |l25j . Nucleon-nuclcon correlations are important in kinem atic 
regions where the impulse approximation is unlikely to apply. The spectral function 126l . 127 1 has 



become a useful model to represent the effects of a many-body model. Developmental versions of 
GENIE now contain spectral functions of Benhar for carbon, iron, and lead. Code for calculating 
(e, e') differential cross sections is now in place. 



A true intcrnuclcar cascade model has also been in development |10l| . It tracks pions and 
nucleons through multiple reactions in the same nucleus in which the neutrino was absorbed. 
Free hadron-nucleon cross sections are used with the struck nucleon has momentum and binding 
energy according to the Fermi Gas model. Interactions for protons, neutrons, and pions are 
presently modeled. The hadrons are on-shell between scatterings; the reactions are governed by 
the same mean free path and Fermi gas models as for the existing model. Thus, there is no limit 
on the number of particles that can be tracked. A simple model for compound nuclear processes 
is included to properly account for effects in hadron-nuclcus data. 



5. Software Design Overview 

In this section we will describe the software design of the GENIE package. We begin by discussing 
the software requirements and use cases. The software framework is presented together with key 
classes such as particles, events, and interactions. The hierarchical delegation of responsibility 
during event generation to driver, thread, and module classes is described. 



5.1. Requirements 

The process of requirements capture and software design was carried out over a period of several 
months in 2004 and 2005. During this phase there was extensive discussion within the MINOS 
collaboration as well as in conjunction with the NuINllf] Conference series. Through these meet- 
ings we received input from many users of these packages as well as with several experts who had 
designed, developed, and supported previous (Fortran-based) procedural codes. 

Through these discussions the need for a package of this type for future neutrino experiments 
became very clear. The procedural programs had often been in use for several decades and had 
expanded greatly beyond their initial scope. They had often reached a critical mass where further 
modifications were deemed extremely difficult because of the overall fragility of the architecture. 
In addition there were often strong couplings between aspects of the simulation that made incre- 
mental improvements in a particular area difficult. Another commonly voiced concern was the 



^Neutrino Interactions in the Few-GeV Energy Range 



lack of documentation, particularly regarding the ways in which the models and their parameters 
were tuned to data. 

These discussions served to illuminate several typical use cases for neutrino event generators and 
related tools: 

• For event generation in conjunction with a full detector simulation. 

• For event generation in fast simulations, either 4- vector only or using a parametrized model 
of the detector response. 

• To provide a library of cross section values for interaction rate calculations. 

• As a source of information about the underlying models and their uncertainties. 

• As a primary tool in the evaluation of systematic uncertainties. 

These use case evaluations and discussions led to the establishment of a set of requirements for 
the overall architecture as well as requirements for the documentation and ancillary tools: 

• Decouple physics models as much as possible from the framework code. 

• Lower the barrier to entry for physics model developers (particularly theorists). 

• Provide a well-tested, well-defined default configuration which provides a benchmark for all 
users who are primarily interested in using the package in black box-mode. 

• Incorporate up-to-date theoretical and experimental work and provide a flexible framework 
so that it can be maintained. 

• Incorporate state-of-the-art software engineering methodologies to support these goals through 
an object-oriented design. 

• Leverage the developments in other areas of HEP software development (in particular 
ROOT class libraries). 

• Provide the external data used to tune the package as part of the overall distribution. 

• Provide clear documentation about how the models are tested, tuned, and validated. 

• Provide a set of tools to facilitate the tuning of model parameters and for an independent 
evaluation of how well models describe existing data. 

These requirements were met in the August 2007 first release of the GENIE package. The 
impl ementation of the physics models was cross-checked through an exhaustive set of comparisons 
[l28t with one of the existing procedural codes [3]. GENIE can be configured to be identical 
in its physics content to the final version (v3.5.5) of the NEUGEN3 package, one of the legacy 
procedural codes which had been used by numerous experiments for over more than a decade. 

5.2. Core Framework 

The key requirement of the GENIE Core Framework is to transparently decouple the high-level 
code focusing on physics simulations from the low-level structures involved primarily with memory 
management and configuration. 

The framework was developed and reviewed primarily within the MIN OS experiment and, in- 
evitably, has been influenced by the MINOS offline software design [l29l |. In developing the GE- 
NIE framework we recycled, adapted and extended key features of the MINOS offline framework. 



We drew heavily from the accumulated software engineering experience encapsulated within pop- 
ular software design patterns, including the Visitor, Chain of Responsibility, Factory, Strategy 
and Singleton [130|. The Core Framework is not specific to the subject matter domain of GENIE 
and could be adapted and reused in other scientific computing applications. The GENIE Event 
Generation Framework, to be discussed later, is a subject matter-specific layer built on top of 
the Core Framework. 

The framework concerns itself mainly with the properties, instantiation and memory manage- 
ment of software abstractions called Algorithms, and specifies the interfaces that underpin the 
interactions between the numerous concrete Algorithm realizations. The Algorithm is a key Core 
Framework abstraction. The notion of an 'algorithm' in an object-oriented system requires fur- 
ther clarification as it does not correspond to its more familiar notion in the context of procedural 
software systems. In the Core Framework the Algorithm encapsulates the common behavior of 
all algorithmic objects. It is an abstract base class which defines exactly how algorithmic objects 
are to be initialized and configured, how they are to look up their configuration, how they are 
to be identified, and how they report their status. These arc common, largely operational fea- 
tures that characterize a very heterogeneous collection of algorithms such as cross section models, 
hadronization models, particle decayers, form factor and structure function models, event gener- 
ation modules and threads and other types of algorithms that can be found within GENIE. The 
fact that such a comm on b ehavior is imposed upon all algorithmic objects allows us to build a 
central, external, XML |l3l| - based algorithm configuration system that contributes significantly 
to the fiexibility and extensibility of GENIE. The kind of computation to be performed, the usual 
identifying feature of an algorithm in a procedural system, is a secondary characteristic at this 
level of abstraction. 

At the next level up from the Algorithm root of an algorithm inheritance tree, we find a stan- 
dardized interface which defines how to invoke each specialized type of calculation and retrieve its 
results. Numerous such specialized algorithm interfaces exist within GENIE. Examples include 
the GFluxI interface for concrete flux drivers, the XSecAlgorithml interface for concrete cross 
section models, and the EventRecordVisitorl interface for for concrete event generation steps. 
Invoking all algorithms through such standardized interfaces guarantees scalability and ensures 
the seamless integration of new concrete implementations. 

The algorithmic objects are stateless and their behavior is fully externally configured. The al- 
gorithm configurations arc stored in XML files. Typically, there is a single XML configuration 
file per algorithm. Each file may contain multiple configuration sets for that algorithm with each 
configuration set being uniquely identified by a name. The algorithm configuration variables 
can be of many different types (including booleans, integers, real numbers, strings, ROOT 1-D 
or 2-D histograms, ROOT n-tuples/trees or other GENIE algorithms with their own configu- 
rations). Each configuration variable, in a given set, is uniquely identified by a name. During 
the initialization phase, all XML configuration files are parsed and each named configuration set 
is stored at a type-safe 'value' — > 'type' associative container called the Registry . All Registry 
objects instantiated in initialization phase are stored in a shared pool called the AlgConfigPool. A 
unique name is being used to identify each Registry in that pool. The name is constructed by the 
name of the configuration set, the name of the algorithm the configuration is intended for and the 
namespace that the algorithm lives in, as 'namespacc::algorithm-name/configuration-name'. At 
run-time each algorithmic object can look up its configuration set by accessing the corresponding 
Registry object. 

One feature of the GENIE configuration system is especially worth noting. Algorithm config- 
uration sets may include other algorithms (with their own configurations, which in turn may 
contain more algorithms). GENIE 's extensibility and flexibility is largely due to this feature in 
conjunction with the standardization of the algorithm interfaces. In the actual GENIE code one 
only needs deflne a call sequence between abstract algorithm-types such as, for example, that an 
algorithm-type specialized in generating scattering kinematics, invokes another algorithm-type 
specialized in cross section calculations which, in its turn, should invoke another algorithm type 



specialized in form factor calculations. Once that call sequence has been defined in the code, 
many concrete realizations may come into being purely at the configuration level by specifying 
the names of the concrete algorithms and the names of their configuration sets. 

Typi cally , pro-configured instances of GENIE algorithms are accessed through an algorithm fac- 
tory [130l | which is responsible for instantiating each algorithm (upon request) and allowing it 
to look up its configuration. The factory typically owns and manages the list of all instantiated 
concrete algorithms. Since algorithms are stateless objects, further requests for an instantiated 
concrete algorithm results in the previously instantiated algorithm being returned rather than a 
new one being created. 

By default all instanti ated concrete algorithms and configurations are stored within shared pools 
designed as singletons |l30l |. As these are shared pools, modifications have global effects. For ex- 
ample, modifying a low-level algorithm configuration modifies all call sequences that include that 
algorithm. This is desirable in most contexts, such as for example for the consistent propagation 
of physics parameter changes throughout GENIE. There arc certain situations, however, such 
as fitting or event reweighting applications, where this may be not be a desirable feature. The 
GENIE Core Framework allows algorithms to clone and assume ownership of the entire sequence 
of sub-algorithms they depend upon, along with each sub-algorithm's configuration registries. 
That cloned call-sequence of algorithmic objects is stored in a local rather than a shared pool. In 
this way, concrete top-level algorithms behave as self-contained capsules and can be re-configured 
in isolation without affecting other GENIE components. 

5.3. Event generation framework: Particles, Events and Interactions 

In this section three key framework classes, the GHepParticle, GHepRecord, and the Interaction 
classes, are described. 

GENIE is using the natural system of units (?i = c = 1) so almost every simulated quantity is 
expressed in powers of [GeV] . Exceptions are the event vertex in the detector coordinate system 
(in SI units) and particle positions in the hit nucleus coordinate system (in fm). Different units 
may be employed when native GENIE event descriptions are converted to experiment-specific 
formats in accordance with the format specification. 

5.3.1. Particles 

The basic output unit of the event generation process is a 'particle'. This is a term used to 
describe both particles and nuclei appearing in the initial, intermediate or final state, as well as 
generator-specific pseudo-particles used for facilitating book-keeping of the generator actions. 

Each such 'particle' generated by GENIE is an instance of the GHepParticle class. These objects 
contain information with particle-scope including: particle ID and status codes, PDG mass, 
charge, name, indices of mother and daughter particles marking possible associations with other 
particles in the same event, 4- momentum, 4-position in the target nucleus coordinate system, 
polarization vector, and other properties. The GHepParticle class includes methods for setting 
and querying these properties. 

GENIE has adopted the standard PDG particle codes [i^. For ions it has adopted a PDG 
extension, using the 10-digit code lOLZZZAAAI where L is the number of strange quarks ZZZ is 
the total charge, AAA is the total baryon number, and I is the isomer number (1=0 corresponds 
to the ground state). GENIE-specific pseudo-particles have PDG code >= 2000000000 and can 
convey important information about the underlying physics model. Pseudo-particles generated 
by other specialized programs that may be called by GENIE (such as PYTIIIA-6) are allowed to 
retain the codes specified by that program. 

GENIE obtains particle data (including particle names, codes, masses, widths, decay channels and 
more) using the ROOT's TDatabasePDG. This particle data-base manager object is initiahzed 



with the constants used in PYTHIA-6. The data-base has been augmented by the GENIE authors 
to in clude baryon resonances, nuclei and GENIE-specific pseudo-particles. Details are given in 
Ref. [not . 

GENIE marks each particle with a status code. This signifies the position of a particle in a time- 
ordering of the event and helps navigation within the event record. Most generated particles are 
marked as one of the following: 

• 'initial state', typically the first two particles of the event record corresponding to the 
incoming neutrino and the nuclear target. 

• 'nuclcon target', corresponding to the hit nuclcon (if any) within the nuclear target. 

• 'intermediate state', typically referring to the remnant nucleus, fragmentation intermediates 
such as quarks, diquarks, or intermediate pseudo-particles. 

• 'hadron in the nucleus', referring to a particle of the primary hadronic system, defined as the 
particles emerging from the primary interaction vertex before any possible re-interactions 
in the nucleus. 

• 'decayed state', such as for example unstable particles that have been decayed. 

• 'stable final state' for the relatively long-lived particles emerging from the nuclear targets. 

All particles generated by GENIE during the simulation of a single neutrino interaction are stored 
in a dynamic container representing an 'event'. 

5.3.2. Events 

Events generated by GENIE are stored in a custom, STDHEP-like event record called a GHEP 
record. Each GHEP event record, an instance of the GHepRecord class, is a ROOT TCloncsArray 
container of GHepParticle objects representing individual particles. 

Other than being a container for the generated particles, the event record holds additional in- 
formation with event-, rather than particle-, scope such as the cross sections for the selected 
event, the differential cross section for the selected event kinematics, the event weight, a series of 
customizable event flags, and interaction summary information (described in the next section). 

Additionally, the event record includes a host of methods for querying / setting event properties 
including many methods that allow querying for specific particles within the event. Examples 
include methods to return the target nucleus, the final state primary lepton, or a list of all stable 
descendants of any intermediate particle. 

The event record features a 'spontaneous re- arrangement' feature which maintains the compact- 
ness of the daughter lists at any given time. This is necessary for the correct interpretation of the 
stored particle associations as the daughter indices correspond to a contiguous range. The par- 
ticle mother and daughter indices for all particles in the event record are automatically updated 
as a result of any such spontaneous particle rearrangement. 

The event generation itself is built around the GHEP event record using the Visitor design pattern 
[l30| |. The interaction between the GHEP event record and the event generation code will be 
outlined in the following sections. 

The GHEP structure is highly compatible with the event structures used in most HEP generators. 
That allows us to call other generators, such as PYTIIIA-6, as part of an event generation chain 
and convert / append their output into the current GHEP event. Additionally the GHEP events 
can be converted to many other formats for facilitating the GENIE interface with experiment- 
specific offline software systems and cross-generator comparisons. 



5.3.3. Interactions 



The GHEP record represents the most complete description of a generated event. Certain external 
heavy-weight applications such as specialized event reweighting schemes or realistic detector 
simulation chains using the generator as the physics front-end require all of the detailed particle- 
level information. However, many of the actual physics models employed by the generator, such 
as cross section, form factor, or structure function models, require a much smaller subset of 
information about the event. 

An event description based on simple summary information, typically including a description 
of the initial state, the process type and the scattering kinematics, is sufficient for driving the 
algorithmic objects implementing these physics models. In the interest of decoupling the physics 
models from event generation and the particle-level event description, GENIE uses an Interaction 
object to store summary event information. Whenever possible, algorithmic objects implementing 
physics models accept a single Interaction object as their sole source of information about an 
event. This enables the use of these models both within the event generation framework but also 
within a host of external applications such as model validation frameworks, event re- weighting 
tools and user physics analysis code. 

An Interaction object is an aggregate, hierarchical structure, containing many specialized objects 
holding information for the initial state [InitialState object), the event kinematics {Kinematics 
object), the process type (Processlnfo object) and potential additional information for tagging 
exclusive channels (XclsTag object). Instantiating Interaction objects for driving physics models 
is streamlined using the 'named constructor' C++ idiom. They can be serialized into unique 
string codes which, within the GENIE framework, play the role of the 'reaction codes' of the 
old procedural systems. These string codes are used extensively for mapping information to 
interaction types. Two examples include mapping interaction types to pre-computed cross section 
splines or mapping interaction types to specialized event generation code. Each generated event 
has an Interaction summary object already attached to it. 

5.4- Event generation processing units: Modules, Threads and Drivers 

On an operational level the responsibility for generating events is shared between event generation 
drivers, threads and modules. Tasks are delegated from event generation drivers to threads, and 
from threads to modules. Event generation drivers can include multiple threads, and threads can 
include multiple modules. Event generation drivers are responsible for generating events for a 
particular user-defined situation. These can be as simple as monoenergetic neutrinos interacting 
off a single target, to complex situations involving the output of realistic beam- line simulations 
and full detector geometry descriptions. Threads are responsible for generating the physics of 
particular classes of events, for instance charged-currcnt quasielastic. Modules carry out a single 
step in that generation process. 

5.4.1. Event generation modules 

An event generation module is a key event generation abstraction. Each event generation module 
encapsulates a well defined event generator operation which, in physics terms, can be any of a 
very diverse set of actions. Examples include selecting the scattering kinematics, generating the 
final state primary lepton or the primary hadronic system, transporting hadrons within the target 
nucleus, and decaying unstable particles. 

Operationally, event generation can be seen as a series of well-defined processing steps operating 
on the GHEP event record. The act of operating on the event record defines an interface that 
is encapsulated by the EventRecordVisitorl abstract class. As it is indicated by the interface 
name, the Visitor design pattern is being employed [l3C| . Concrete event generation modules, 
implementing the EventRecordVisitorl inteviacc, 'visit' the event record. The event record then 
invokes each attached module and is modified as a result. 



Due to the diversity of the event processing operations that must be considered by GENIE, 
we formed the event generation module abstraction focusing on the common operational aspect 
of (potentially) modifying the event record. This represents the most generic way of thinking 
about event generation and guarantees that any future physics addition, especially ones not 
envisioned at this stage of the generation evolution, can be trivially embedded into the existing 
framework. Treating the event generator modules uniformly and standardizing on the event 
generation module interface allows us to build a flexible and extensible system where modules 
can be dynamically plugged in/out of the event generation or interchanged. Examples can further 
clarify the utility of this abstraction: a module handling a set of particle decays can be unplugged 
to inhibit those decays, or a module handling intra-nuclear hadron transport may be swapped 
with another module performing the same operation using a different physics strategy. 

Whenever possible event generation modules are written in a generic way, containing code im- 
plementing just the neutrino event generation mechanics. The actual physics model itself is 
specified in the generation module configuration. This decoupling of mechanics from models 
greatly simplifies code development, transparency, and physics validation, simplifying the overall 
structure and reducing the amount of code that needs to be actively developed and scrutinized 
between successive releases. An example will clarify this factorization: The module selecting 
the kinematics for deep-inelastic neutrino-nucleon interactions does not contain the actual code 
for the deep-inelastic differential cross section. It merely contains code to calculate the allowed 
kinematical phase space for the process, select a point in that phase space using a Monte Carlo 
acceptance / rejection method, and update the GHEP record accordingly. The actual differential 
cross section model used during the Monte Carlo selection is an external physics model invoked 
by the event generation module. The module itself can be recycled many times by instructing it 
to call a different cross section model each time. As a result of that factorization, multiple call 
sequences can be defined purely at the configuration level without code duplication. 

5.4-2. Event generation threads 

An event generation thread is an ordered sequence of processing steps, encapsulated by event 
generation modules, that can be applied to an empty GHEP event record to completely gener- 
ate some class of physics events. This process defines an interface that is encapsulated by the 
EventGeneratorl abstract class. Within the GENIE event generation framework the structures 
containing a comprehensive set of instructions for generating a class of physics events are concrete 
EventGeneratorl objects. 

GENIE defines a comprehensive set of event generation threads responsible for generating event 
types at the level of fundamental interactions. The complete set of these event generation threads 
comprises GENIE's full 'physics content' for event generation. As an event generation thread can 
generate a single class of events only, there are usually multiple threads in use. 

The class of physics events generated by a thread can have an arbitrary granularity, from a single 
interaction corresponding to a particular process type with a given final state to very broad 
event categories. Each thread contains an InteractionList object, a container containing a list 
of the Interaction objects the thread can generate. The InteractionList plays a crucial role in 
identifying the responsibilities of each thread within the GENIE framework. Once an event type 
to be generated has been selected, a corre spon ding Interaction object is instantiated. Following 
the Chain of Responsibility design pattern |130l | , GENIE attempts to match the Interaction object 
with an element of the InteractionList containers for all active threads. The first thread found 
that is able to handle that event type is handed the responsibility to generate the event. 

Additionally, event generation threads include an instance of the cross section algorithm that 
can be used for selecting the event kinematics or for computing the probability for a particular 
neutrino to interact. This is another example of separating mechanics from models and serves to 
greatly simplify the dynamic mapping between event types and cross section models. 

Once a list of threads has been loaded into the generator, many high-level event generation op- 



erations became trivial. Compiling the list of all event types that can be generated by GENIE 
in its current configuration simply involves looping over the active threads and adding the cor- 
responding InteractionList objects. Selecting an event type to be generated from that master 
list involves looping over its Interaction objects and, for each element, identifying the responsible 
thread, requesting its corresponding cross section model and invoking it by passing the Inter- 
action object as argument. Once an event type has been selecting generating the event simply 
involves looking up the responsible thread and delegating responsibility to it. 

During event generation an invoked thread maintains a modification history of the event record. 
If a tried event generation path leads to a dead-end, the current event generation module throws 
an exception and aborts. The event generation thread catches that exception and, depending 
on information stored at it, may rerun the event using a snapshot of the event record taken N 
steps back, in the hopes of taking an alternative path and avoiding the encountered dead-end. 
If a configurable maximum number of exceptions is caught, or if any thrown exception specifies 
explicitly that generation of the current event is to be aborted altogether, the thread sets the 
appropriate error flags and makes sure that the remaining processing steps are skipped. The user, 
via options set in the event generation driver, may choose to keep certain types of these events so 
as to examine their type and frequency, though the default behavior of GENIE is to discard these 
events and only write out physical, fully formed events. Error handling within each active thread 
greatly adds to the robustness and fault-tolerance of the package, which is especially valued in 
large-scale, CPU-intensive, experiment-specific Monte Carlo production runs involving hundreds 
of CPU cores over many weeks. 

Advanced users can modify the default event generation threads by removing / adding event 
generation modules, or they can define their own uniquely named threads for handling new 
processes or handling existing processes in new ways. 

5.4-3. Event generation driver classes 

GENIE provides two event generation driver classes. These drivers collect the user inputs, in- 
stantiate and configure all required event generation components, and oversee communications 
between these components, the computing environment, and the user. 

The two driver classes support two different types of functionality: 

• Instances of the GEVGDriver class can handle event generation for a given initial state 
corresponding to an arbitrary neutrino / target pair. 

• Instances of the GMCJDriver class can be used for more complicated simulations involv- 
ing arbitrarily complex, realistic beam flux simulations and detector geometry descriptions. 
This driver object concerns itself mostly with driving the flux and detector geometry nav- 
igation drivers and integrating those with the GENIE event generation framework. It 
represents a significantly more complex and CPU-intensive event generation case but relies 
entirely on a host of GEVGDriver instantiations, one for each possible initial state in that 
Monte Carlo job. in order to obtain neutrino interaction physics modeling capabilities and 
generate event kinematics. 



6. GENIE Event Generation Applications and Utilities 

GENIE is being used by a host of precision-era neutrino experiments and provides off-the-shelf 
components for generating neutrino interactions under the most realistic assumptions. The event 
generation driver classes described in Section [5] are encapsulated within driver applications which 
present the user with a command-line or graphical interface, instantiate and configure those driver 
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Figure 6: A UML diagram depicting the GENIE event generation framework. See text for details. 



classes, call the event generation methods to generate the requested number of events, and push 
those events through a persistency manager. 

In experiment-specific GENIE-bascd event generation drivers utilizing the GMCJDriver one can 
integrate the GENIE neutrino interaction modeling with detailed flux and detector geometry de- 
scriptions. This is a non-trivial operational capability that older procedural neutrino generators 
typically lacked, requiring significant development effort from experiments. The flux descriptions 
are typically derived from experiment-specific beam-line simulations while the detector geome- 
try descriptions are typically derived from engineering drawings mapped into the GEANT4 



ROOT [l^l or GDML |132| geometry description languages. Obviously, flux and detector geome- 
try descriptions can take many forms, driven by experiment-specific choices. GENIE standardizes 
the geometry and fiux driver interfaces by defining the operations that GENIE needs to perform 
on the geometry and flux descriptions and the essential flux and geometry information needed 
for the generation of events. Concrete implementations of the interfaces allow experiments to 
extend GENIE's event generation capabilities and make it possible to seamlessly integrate new 
geometry descriptions and beam fluxes into user applications. 

In this section we will describe in some detail the flux and geometry interfaces. We will briefly 
describe applications built from these drivers as well as GENIE utilities to evaluate and display 
cross section information, make comparisons to external data, and facilitate model tuning. 



6.1. Neutrino flux drivers 



In GENIE every concrete flux driver implements the GF/zta;/ interface. The interface defines what 
neutrino flux information is needed by the event generation drivers and how that information is 
to be obtained. Each concrete flux driver includes methods to: 



• Declare the neutrino flavors that can generate events. This information is used for initial- 
ization purposes, in order to construct a list of all possible initial states in a given event 
generation run. 

• Declare the maximum energy. Again this information is used for initialization purposes, 
in order to calculate the maximum possible interaction probability in a given event gener- 
ation run. Since neutrino interaction probabilities arc tiny. GENIE scales all interaction 
probabilities in a particular event generation run so that the maximum possible interaction 
probability is 1. That maximum interaction probability corresponds to the total interac- 
tion probability (summed over nuclear targets and process types) for a maximum energy 
neutrino following a trajectory that maximizes the density-weighted path-lengths for each 
nuclear target in the geometry. GENIE adjusts the MC run normalization accordingly to 
account for this probability rcnormalization. 

• Generate a flux neutrino and specify its pdg code, its weight (if any), its 4-momentum and 
4-position. The 4-position is given in the detector coordinate system (as specified by the 
input geometry) . Each such flux neutrino is propagated towards the detector geometry but 
is not required to cross any detector volume. GENIE will take that neutrino through the 
geometry, calculate density-weighted path-lengths for all nuclear targets in the geometry, 
calculate the corresponding probabilty for scattering off each nuclear target and decide 
whether that flux neutrino should interact. If it interacts, an appropriate GEVGDriver will 
be invoked to generate the event. 

• Notify that no more flux neutrinos can be thrown. Flux drivers that use the output of beam- 
line simulations, so-called 'flux files', are configured to recycle these flux files multiple times 
in a given run since most neutrino fiux entries do not produce an interaction. The fiag 
allows GENIE to properly terminate the event generation run once this limit is reached, 
irrespective of the accumulated number of events, protons on target, or other metric of 
exposure. 



The above correspond to the common set of operations/information that GENIE expects to be 
able to perform/extract from all concrete flux drivers. Specialized drivers may define additional 
information that can be utilized in experiment-specific applications. One typical example of this 
is to 'pass-through' information about the fiux neutrino parents placed in the flux files by the 
beamline simulation, such as the parent meson PDG code, its 4-momentum, and its 4-position 
at the production and decay points. 

At the time of writing this article, GENIE already includes a host of concrete flux drivers allowing 
GENIE to be used in many realistic, experiment-specific situ ations. More specifically, it includes 



an interface to the JPARC neutrino beam simulation [l33l | used by Sup er-K amiokande |134l |. 



nd280 01, and INGRID and an interface to the NuMI beam simulation [l35t used by MINOS 



[l36t . NOvA B MINERvA @, MicroBooNE ^ and ArgoNEUT [Tlj. It includes drivers for 
the BGLRS |l37| and the FLUKA [iSSj atmospheric fluxes. Moreover, it includes a generic 
flux driver, describing a cylindrical neutrino flux of arbitrary 3-D direction and radius, with a 
configurable radial dependence, which can be used for describing a flux containing a number 
of different neutrino species whose (relatively normalized) energy spectra arc described as 1-D 
histograms. GENIE, obviously, also includes the trivial case of a monoenergctic fiux typically 
employed in physics benchmarking calculations. 



Concluding this section it is worth re-emphasizing the fact that new concrete flux drivers (describ- 
ing the neutrino flux from other beam-hnes) can be easily developed and they can be effortlessly 
and seamlessly integrated within the GENIE event generation framework. 

6.2. Geometry drivers 

In GENIE every concrete geometry driver implements the G eom Analyzer I mteTi&ce. The interface 
specifies what information about the input geometry is relevant to the event generation and how 
that information is to be obtained. Each concrete geometry driver implements methods to: 

• Declare the list of target nuclei that can be found in the geometry. This information is used 
for initialization purposes, in order to construct a list of all possible initial states in a given 
event generation run. 

• Compute the maximum density-weighted path-lengths for each nuclear target in the geom- 
etry. Again, this is information used for initialization purposes. The computed 'worst-case' 
trajectory is used to calculate the maximum possible interaction probability in a particular 
event generation run. This maximum interaction probability is used internally to normalize 
all computed interaction probabilities in that run. 

• Compute density-weighted path-lengths for all nuclear targets, for a trajectory of given 4- 
momentum and starting 4-position. This allows GENIE to calculate probabilities for each 
flux neutrino to be scattered off every nuclear target along its path through the detector 
geometry. 

• Generate a vertex along a trajectory of given 4-momentum and starting 4-position on a 
volume containing a given nuclear target. This allows GENIE to place a neutrino interaction 
vertex within the detector geometry once an interaction of a flux neutrino off a selected 
nuclear target has been generated. 

GENIE currently contains a concrete geometry driver able to handle the ROOT-based detector 
geometry descriptions typically used by most neutrino experiments. Detector geometry descrip- 
tions based on GEANT or GDML can be converted into ROOT-based descriptions and used by 
the same GENIE geometry driver as well. GENIE also includes a driver for more trivial geometry 
descriptions corresponding to a single nuclear target or a target mix (a set of nuclear targets, 
each with its corresponding weight fraction) at a flxed position. This simpler geometry driver 
may be used in simulating flxed initial states for benchmarking calculations or in experimental 
situations where a relatively uniform detector is being illuminated by a spatially uniform neu- 
trino beam. An example of the latter would be a detector placed far enough from the beam-line 
instrumentation so as to see a point-like neutrino source. 

Again it is worth re-emphasizing that any new detector geometry description can be seamlessly 
integrated with the GENIE event generation framework by means of developing an appropriate 
GENIE geometry driver. 

6.3. Event generation outputs 

The generated events are stored in the ROOT flle format. The typical output of an event 
generation run is a single ROOT flle which contains an event tree with a single branch and a 
single leaf per event containing the generated GHEP record. User-deflned branches to write 
out experiment-specific information may be added to that tree with the user-defined information 
'linked' to the corresponding generated neutrino event. The output ROOT flle contains directories 
storing all GENIE conflguration information for the MC run and a snapshot of the running 
environment for later reference. GENIE provides a persistency manager object which can be 
employed within the event generation driver applications to write out the event tree. 



6.4- Event generation applications 



GENIE is distributed with many event generation applications and utilities, many of which 
are straightforward wrappers for the components described above. Users interact with these 
applications through simple command-line interfaces, user-created XML configuration files, and 
environmental variables. 

The gevgen application can be used in simulating given initial states for benchmarking calcu- 
lations or simple experimental setups for which histogram-based flux descriptions and simple 
geometry descriptions in terms of a target mix are adequate. Experiment-specific event genera- 
tion applications, such as gT2Kevgen and gNuMIevgen, employ the detailed JPARC and NuMI 
beam-line simulations and the ROOT-based detector geometry descriptions of the corresponding 
experiments and are used by a large fraction of the GENIE user base. 

On a MacBookPro running MAC OS X 10.5.6, with a 2.16 GHz Intel Core 2 Duo and 1 GB 
DDR2 SDRAM at 667 MHz, and with all event generation threads enabled, GENIE simulates 
around 70 events/sec for a i/^ + Fe^^ initial state at E^, = IGeV. The speed is 5 events/sec 
with the detailed nd280 ROOT-based detector geometry description (40 nuclear targets) and the 
detailed JNUBEAM-based JPARC neutrino beam simulation (4 neutrino species). 

6.5. Utilities 

Event generation for a realistic experimental setup typically requires of the order of ^ 10^ — 10^ 
differential cross section evaluations just in order to select an interaction to be generated. It 
is therefore very practical to perform the numerical differential cross section integrations at a 
different stage and save the data for building cross section splines. The event generation com- 
ponents can recycle these splines for performing fast numerical interpolations, greatly improving 
the event generation efficiency. GENIE provides a utility, gmkspl, to generate all required cross 
section splines for the intended set of neutrino flavors and nuclear targets over the required energy 
range and write the data in the XML format expected by the event generation components. 

For many use-cases it is convenient to analyze the output GHEP ROOT event tree and either 
write-out simpler, flat n-ntuples containing summary information or convert it to a format ex- 
pected by an experiment-specific application, such as a detector-level simulation that doesn't use 
the GENIE I/O. GENIE provides an event tree converter, gntpc , which writes out a host of al- 
ternative plain text, XML or bare-ROOT formats currently is use by GENIE-based applications 
in client experiments. 

Several tools exist for the purposes of validating and tuning the physics models in GENIE. The 
ultimate s ourc e of data for many of these comparisons is the DURHAM Neutrino Scattering Data 
Resource jll4| , an online resource with large compilations of neutrino data from many diff erent 
experiments. Access to this data in the GENIE package is done through the NuValidator [ll5| 
program, a GENIE add-on that can be optionally installed. Distributed with the NuValidator in 
XML format are d ata from the DURHAM database, electron scattering data from t he Je fferson 
Lab database [ll2| . and publicly available lepton scattering structure function data Elec- 
tron scattering data and lepton structure function data arc important in evaluating any sc hem e 
for handling the perturbative / non-perturbative transition region described in Sec. [3XT][llli . 
This data is then available for a variety of applications. The NuValidator includes a simple GUI 
interface allowing one to select data to display together with the GENIE prediction, with full 
ability to set model parameters to new values. The data can also be accessed for physics model 
parameter fits possibly including new experimental data as well as external, historical data. 

In addition to event generation and validation the flexibility of the GENIE framework also sim- 
plifies many downstream analysis tasks. The evaluation of generator-related systematic errors 
can be greatly simplified through event reweighting |l40| . As mentioned in Sec. 15.21 GENIE's 
ability to allow algorithm configuration using local pools, redundancy of information between 



GHEP event records and Interaction objects, and ability to identify algorithms/configurations in 
generated output all support the development of experiment-specific event reweighting programs. 

Details on the application describ ed ab ove and on a host of other utilities may be found in the 
GENIE Physics and User Manual [llo| . 



7. Collaboration Organization 

A significant organizational challenge for the GENIE project is defining the working relationship 
with experimental collaborations. Having many experiments use the same neutrino event gener- 
ator is a new development in this field and the exact nature of these working relationships will 
evolve over time. However some realities and goals are clear. 

Good two-way communication is essential both for immediate issues like bug reporting and sup- 
port and for longer term issues like planning of upcoming physics releases. Experiments often 
set target dates to begin production of Monte Carlo samples based on publication plans and 
expectations for data-taking. Meeting these deadlines is often a high priority for these collabora- 
tions. Since the time-scale for production of large Monte Carlo samples is roughly similar to the 
timescale for production of new GENIE physics releases, it will be desirable to synchronize, or at 
least be cognizant of, upcoming experimental deadlines to as large a degree possible. Discussions 
with collaborations will also focus on priorities for physics model improvements. 

One conduit for these collaborations will be through experimental liaisons who serve as the main 
contact within an experiment on GENIE issues. This person can then report on the experiment's 
experiences and deadlines, and can help present the experiment's priorities for model improve- 
ments to the GENIE collaboration for evaluation. When effort within the GENIE collaboration to 
incorporate new model work is not available, these liaisons can assist in providing and organizing 
effort from their collaborations. 

Physics model development is partitioned into subtasks including the cross section model, the 
hadronization model, and the intranuclear rescattering model. These components are all rela- 
tively self-contained and have validation procedures independent of the others. Overall tuning 
and validation of a production release is the responsibility of a separate working group of the 
collaboration. This task is undertaken on a roughly yearly timescale. This exercise finalizes the 
model set and determines values of all parameters based on fits to external data. This process 
also determines parameter errors which can then be used by experiments in the evaluation of 
generator-related systematic errors. 

A default tune - a self consistent set of physics models and parameter values - will be specified 
by the collaboration for every major release, along with information about the uncertainties on 
model parameters and possible correlations. It is possible for experiments, using the validation 
and parameter fitting tools that are provided as part of the GENIE package, to have their own 
'tuning' of the generator. This would be desirable from an experiment's perspective as they 
have access to new data that, due to the aforementioned uncertainties, will probably not be in 
complete agreement with the default tuning. It is hoped that the development of new models by 
specific collaborations as part of ongoing analyses will be speedily adopted into GENIE for the 
benefit of other users, though the decision of when to make their new models public will be left to 
the discretion of the user collaborations. It will, however, be important that model improvements 
be merged back into the default tuning on a regular basis so as to prevent the fragmentation of 
the base set of physics models into many different experiment-tuned versions which then evolve 
independently over many years. 

The collaboration organizes its effort internally through occasionall meetings of physics model and 
tuning/validation working groups, phone meetings, blogs, and web-based document databases. 
The GENIE web site [l[ is the centr al rep ository for all information related to the pack age. An 



extensive Physics and User Manual [llOj, a web-based source code Reference Manual jl41j as 



well as a support mailing list are available to GENIE users. Hands-on tutorials on GENIE have 
been given on several occasions at meetings in the U.S., Europe, and Japan, and material from 
these workshops with introductory and advanced tutorial examples arc available on the GENIE 
web site. 



8. Code Availability 

8.1. Version control and distribution 

GENIE is available from its CVS repository hosted at STFC's Rutherford Appleton Laboratory 
from where one can access the development version and a series of 'frozen' production-quality 
releases. The repository is physically located on AFS space and, in read-only mode, can be 
accessed anonymously. Read-write mode access to the code repository is provided to the GENIE 
collaborators via SSH and public key authentication. A transition to a Subversion repository is 
planned for the near future. Up-to-date details on how to acce ss the source code are given at the 
GENIE web-site [H and at the Physics and User Manual [ll^. 



8.2. Supported platforms and external dependencies 

GENIE is known to build on many platforms, including all popular LINUX distributions and 
MAC OS X, and has no OS proprietary dependency. As of GENIE v2.5.1, external dependencies 
for a minimal installation that can be used for physics MC production include the ROOT Class 
Libraries the LHAPDF parton density function library [l4i|, the PYTHIA-6 LUND MC 
[l43t and two fairly common utilities: the libxml2 XML parser and the log^cpp error logger. 



8.3. Versioning scheme and release lifetime 

GENIE versions are numbered as H.j.k\ where i, j and k are the major, minor and revision 
indices respectively. The corresponding CVS tag is 'i? — i-jJt\ When a number of significant 
functionality improvements or additions have been made, the major index is incremented. The 
minor index is incremented in case of significant fixes or minor feature additions. The revision 
number is incremented for minor bug fixes and updates. Versions with even minor number 
correspond to validated, physics production releases. Versions with odd minor number correspond 
to the development version of 'candidate' releases tagged during the validation stage preceding a 
physics production release. Tagged versions always have an even revision number. Odd revision 
numbers correspond to the CVS head. 

Because of the effort invested by the experimental communities in generating large samples and 
understanding the impact of the simulation changes into their physics results, physics production 
releases are nominally supported for a long term of approximately 18 - 24 months. 

8.4- License 



GENIE is now distributed under the GPLv3 license agreement [144 1 



9. Conclusions 



GENIE provides a modern and versatile platform for a universal, 'canonical' Neutrino Interaction 
Physics Monte Carlo whose validity will extend to all nuclear targets and neutrino flavors over a 
wide range of energies from MeV to PeV scales. Currently, it includes state-of-the-art neutrino 
interaction physics modeling in the few-GeV energy range which is relevant for the current and 
near future long-baseline precision neutrino experiments using accelerator-made beams. 



The software was designed using object-oriented methodologies and developed entirely in C++ 
over a period of more than three years, from 2004 to 2007. The design of the package decouples 
the mechanics of event generation from the physics models, providing modularity, extensibility, 
and flexibility. The package supports the full life-cycle of generator-related activities, from event 
generation using detailed detector geometries and flux information, to final analysis tasks such as 
reweighting and systematic error evaluation. The data, programs, and procedures used to validate 
and tune the package are all distributed with the package itself, allowing users the ability to easily 
extend the package and evaluate new models. 

The project is supported by a group of physicists from all major experiments operating in this 
energy range, establishing GENIE as a major HEP event generator collaboration. GENIE has 
already been adopted by many neutrino experiments, including those using the JPARC and NuMI 
neutrino bcamlincs, and will be an important physics tool for the worldwide accelerator neutrino 
program. 
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